NPS-55-83-008 



NAVAL POSTGRADUATE SCHOOL 



Monterey, California 



STATIONARY EXPONENTIAL TIME SERIES: 
FURTHER MODEL DEVELOPMENT AND 
A RESIDUAL ANALYSIS 
by 

P. A. W. Lewis 
A. J. Lawrance 



Approved for public release; distribution unlimited 

Prepared for: 

Chief of Naval Research 
Arlington, VA 22217 




April 1983 



FedDocs 
D 208.14/2 
NPS-55-83-008 




7 



Rear Admiral J. J. 
Superintendent 



This work was 
Grant NR-42-284. 



NAVAL POSTGRADUATE SCHOOL 
Monterey, California 



Ekel und 



David A. Schrady 
Provost 



supported in part by the Office of Naval Research under 



Reproduction of all or part of this report is authorized. 



UNCLASSIFIED 



SECURITY CLASSIFICATION OF THIS PAGE (TWl»n Dm Entered) 



DUDLEY KNOX LIBRARY 

NAVAL POSTGRADUA cSt 

MONTEREY CA 939^.3-^10 



REPORT DOCUMENTATION PAGE 


READ INSTRUCTIONS 
BEFORE COMPLETING FORM 


i. REPORT NUMBER - 2. ©OVT ACCESSION NO. 

NPS-55-83-008 


3. RECIPIENT'S CATALOG NUMBER 


4. TITLE (and Subtitle) 

STATIONARY EXPONENTIAL TIME SERIES: FURTHER 

MODEL DEVELOPMENT AND A RESIDUAL ANALYSIS 


5. TYPE OF REPORT A PERIOD COVERED 

Technical 


6. PERFORMING ORG. REPORT NUMBER 


7. AU THOR(a) 

A. J. Lawrance 
P. A. W. Lewis 


8. CONTRACT OR GRANT NUMBERC*) 


9. PERFORMING ORGANIZATION NAME AND ADDRESS 

Naval Postgraduate School 
Monterey, CA 93940 


10. PROGRAM ELEMENT. PROJECT, TASK 
AREA A WORK UNIT NUMBERS 

61153N; RR0 14-0 5-01 
N0001483WR30026 


11. CONTROLLING OFFICE NAME AND ADDRESS 

Chief of Naval Research 
Arlington, VA 22217 


12. REPORT DATE 

April 1983 


IS. NUMBER OF PAGES 

41 


14. MONITORING AGENCY NAME A ADDRESS (If different from Controlling Office) 


IS. SECURITY CL ASS. (ot thle report) 

UNCLASSIFIED 


1 5a. DECLASSIFICATION/ DOWN GRADING 
SCHEDULE 


16 DISTRIBUTION STATEMENT (ot thle Report) 

Approved for public release; distribution unlimited 


17. DISTRIBUTION STATEMENT (of the mbttrmct entered In Block 20, If different from Report) 


18. SUPPLEMENTARY NOTES 


19. KEY WORDS (Continue on reveree elde if neceeeery end identify by block number) 

autoregressive process NEAR(2) process 

exponential variables non-normal processes 

residual analysis 
wind speed data 


20. ABSTRACT (Continue on revere* eld* If neceeeery end identify by block number) 

A second order autoregressive process in exponential variables, NEAR(2), is 
established: the distributional assumptions involved in this model highlight a 

very broad four parameter structure which combines five exponential random 
variables into a sixth exponential random variable. The dependency structure 
of the NEAR(2) process beyond and including autocorrelations is explored using 
some new ideas on residual analysis for non-normal processes with autoregres- 
sive correlation structure. Other applications of the exponential structure 
are considered briefly. These include exponential time series with negative 



DD 



form 
1 JAN 73 



1473 EDITION OF 1 NOV 65 1$ OBSOLETE 

S/N 0102- LF- 014- 6601 



UNCLASSIFIED 



SECURITY CLASSIFICATION OF THIS PAGE (When Dete Entered) 



UNCLASSIFIED 

SECURITY CLASSIFICATION OF THIS PAGE (Whin Dtm Bnfnd) 



correlation and exponential time series with mixed autoregressive-moving 
average structure. An application to the analysis of a set of wind speed 
data is included. 



S/N 0102- LF- 014- 6601 

UNCLASSIFIED 



SECURITY CLASSIFICATION OF THIS PAGEfWian Data Enltred) 



Stationary Exponential Time Series: Further model development 

and a residual analysis 



by 



A. J. Lawrance 
Department of Statistics 
University of Birmingham 
Birmingham, England 



P. A. W. Lewis 
Department of Operations 
Naval Postgraduate School 
Monterey, California, U.S.A. 



ABSTRACT 

A second order autoregressive process in exponential variables, NEAR(2) , 
is established: the distributional assumptions involved in this model high- 

light a very broad four parameter structure which combines five exponential 
random variables into a sixth exponential random variable. The dependency 
structure of the NEAR(2) process beyond and including autocorrelations is 
explored using some new ideas on residual analysis for non-normal processes 
with autoregressive correlation structure. Other applications of the expo- 
nential structure are considered briefly. These include exponential time 
series with negative correlation and exponential time series with mixed 
autoregressive-moving average structure. An application to the analysis of 
a set of wind speed data is included. 



January 1983 



1. Introduction 



There are several aspects of many observed univariate time series which 
are not satisfactorily accounted for in standard time series analysis: they 

include non-Gaussian marginal distributions, dependence beyond second-order 
moments (autocorrelations) and directionality in the time series. Quite 
often a Gaussian distribution will be inappropriate because the variable 
being modelled has a positive and highly skewed distribution, i.e., the 
service times in a queue or daily flows in a river. Many particular such 
distributions can be envisaged and time series models have been constructed 
for them. Examples are Gamma distributions (Gaver and Lewis, 1980; Lewis, 

1981; McKenzie, 1982; Lawrance, 1982) and mixed exponential distributions 
(Gaver and Lewis, 1980; Lawrance, 1980a; Lawrance and Lewis, 1982). 

However the simplest, most widely used and most tractable analytically 
of these distribution models is the exponential distribution. Like Gaussian 
random variables, exponentially distributed random variables enjoy many 
special properties; also they can be mildly transformed quite easily into 
distributions which are more skewed, or less skewed, than the exponential. The 
Weibull distribution is an example, being just a power transformation of an 
exponentially distributed random variable. Thus the approach here, following 
earlier work (Gaver and Lewis, 1980; Lawrance and Lewis, 1980, 1981) is to 
regard the exponential variables as canonical and to develop their use in 
time series modelling. It should also be noted that time series of uniformly 
distributed random variables are obtained by exponential transformations of 
the exponential time series; this has many possibilities and the process 

of uniformly distributed random variables will be considered elsewhere. 

The work cited above has concentrated for the most part on first-order, 
non-Gaussian autoregressive models, both of the standard type (constant 
coefficient additive linear combinations) and a random coefficient type 
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introduced by the authors. The extension of the models to higher order 
autoregression is clearly necessary to attain flexibility in modelling cor- 
relation and dependency structure of the processes, but these extensions are 
in no way as immediate as in the standard linear Gaussian case. A simple 
mixing device can be used (Jacobs and Lewis, 1983) but the range of correla- 
tions attained is much narrower than the range attained in the standard 
linear, second-order autoregressive structure. A broader extension, called 
the EAR(2) model, was obtained in the exponential case by Lawrance and Lewis 
(1980), but its innovation variable has a zero component and this will often 
be hard to justify. 

A major part of the present work consists of obtaining a very broad and 
rich extension of the NEAR(l) model (Lawrance and Lewis, 1981) to a second- 
order autoregressive process; this NEAR(2) model was first proposed in 
Lawrance (1980b) and later reviewed in Raftery (1981), although necessary 
analysis of its innovation structure was not given. The innovation random 
variable for the NEAR(2) process is proved here to exist without unnatural 
boundaries on its (four) parameter region and does not have a zero component. 
Taken out of the context of second-order autoregressive exponential time 
series this result gives a very broad structure for combination of exponential 
random variables. In fact it gives a way to combine three (possibly dependent) 
exponential random variables with an independent pair of (possibly dependent) 
exponential random variables to give another exponential random variable. The 
result has a number of uses beyond the second-order time series model giving, 
for instance, a first-order model which is broader than the NEAR(l) model, and 
several second-order mixed autoregressive-moving average models. Schemes for 
obtaining negative correlation can also be accommodated. These models, as in 
the first-order case, include aspects of dependence beyond autocorrelation and 
also directionality in the time series. 
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The richness of the four-parameter NEAR(2) model and the fact that an 

infinite number of cases of the model with identical correlation structure 

are available forces consideration of higher-order aspects of dependence. 

The analysis of the higher order aspects of the exponential time series is 

at a fairly early stage and is as follows. First it will be shown that the 

autocorrelations p ( Jt) , £ = 0, + 1, +2,... for the NEAR(2) process satisfy 

the Yule-Walker equations with constants a-j and a 2 which are functions 

of the four parameters of the model. This follows immediately from the fact 

that X„ is a random-coefficient, linear additive combination of X , . 
n n-1 ’ 

X n _2 and the innovation r.v. E n . Secondly, it can be shown (Lewis and 
Lawrance, 1983a) that the residuals X n - X n _ 1 - a 2 X n -2 > which are the 
usual residuals for second-order constant coefficient, linear additive 
autoregressive processes are uncorrelated . 

Thus although the standard analysis of time series stops with uncorrelated 
residuals, i.e. a flat spectrum for the residuals, such residuals can also be 
used to good effect to investigate higher order aspects of dependence in the 
NEAR(2) model. In fact, if the autoregression is not of the standard type 
(constant coefficient, linear additive) the (uncorrelated) residuals will 
have non-random scatters and higher order dependence properties. In partic- 
ular, the squared residuals will have non-zero autocorrelations and the 
cross-correlations of residuals and squared residuals except in the zero- 
lag case will be non-zero; both sets of correlations are effectively zero 
when a standard second-order autoregressive model is appropriate. 

An extension of this residual analysis is suggested by the fact that 
autocorrelations are time reversible, while the process itself is not time 
reversible, i.e. is directional. Thus one constructs reversed residuals 
X n ~ a l^n+l “ a 2^n+2 are to be usec * to assess directionality. Analysis 
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of both types of residuals can be fruitful; because of space limitations we 
present here only an investigation of cross-correlations of residuals and 
squared-residuals. This particular case of the possible residual analyses 
is illustrated by some theoretical calculations, some simulations and a brief 
application to a transformed series of wind speed data. The complete residual 
analysis including directional aspects and their investigation using the 
additional tool of reversed residuals will be given elsewhere. 
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2. Exponential Time Series Models 

Our aim in this section is to give in outline the ideas leading to the 
time series model of main concern in this paper, and called NEAR(2) , follow- 
ing the earlier terminology NEAR(l). The NEAR(2) model has four-parameters, 
and incorporates and broadens the earlier two-parameter EAR(2) model (Lawrance 
and Lewis, 1980). The NEAR(2) model will be exponential in marginal distri- 
bution, have second-order autoregressive Markov dependence, and have auto- 
correlations satisfying second-order difference equations of the familiar 
Yule-Walker type. In addition it will have dependence beyond autocorrelation, 
and will have adjustable directionality. It is not linear in the standard 
sense, having random-coefficient, linear additive autoregressive structure, 
but neither is it non-linear in the standard sense of incorporating powers or 
products of lagged variables. 

Writing {X^} for the time series variables, and { E n } for an i.i.d. 
exponential innovation sequence of unit mean, the two-parameter NEAR(l) model, 
as previously defined, is given by 



CD 

X 

3 
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w.p. 
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E 
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w. p. 


p > 


X = { 




+ - 








n 1 












1° 


w.p. 


1 -a 


■ bE n 


w. p. 


1-p , 



( 2 . 1 ) 



with b = ( 1 -a) B and p = (1 -b)/{ 1 - (l-a)B} . The parameter region is in 

general 0 < o, M 1, a = B M . The case B = 1 , 0 _< a < 1 is rather 

special, and has been called the TEAR(l) model, and when a = 1 , 0 _< B < 1 , 

the earlier EAR(l) model is recovered. Except for this EAR(l) case, the 

NEAR ( 1 ) model does not allow zero innovations (Gayer and Lewis, 1980) and so 

is more statistically acceptable . The zero innovation implies that 

X = eX and thus B can be determined exactly from runs down in the 
n n- 1 

sample path of the process. 

In general the i.i.d. innovation in the NEAR(l) process are formed as 
the probabilistic mixture of two exponentials, and are thus easily simulated. 
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The NEAR(2) model is a direct generalization of (2.1) and takes the form 





+ E 



n 



( 2 . 2 ) 



0 



W. p. 1 -a-| - (*2 



with parameter region 0 < , S 2 £ 1 , a ] >_ 0, a 2 > 0, + c* 2 < 1 ; (E n > is 



an appropriately chosen innovation sequence. Many special cases can arise 
when the above restrictions include some of the equalities and, for the pur- 
poses of a general development, it is best to regard the inequalities as 
strict. Given that (X n } is required to have an exponential marginal distri- 
bution, the main question concerns whether there is a valid probability 
distribution for E n . The Theorem proved in Section 2.3 will show that 
this is the case, and that the distribution, when the inequalities on 
a l t <* 2 and 3 ^, ^ -j n the parameter region are strict, takes the form 



a probabilistic mixture of three exponentials. To establish this result a 
fairly detailed analysis of a derived moment generating function is required. 
This is necessary since a direct moment generating function solution of (2.2) 
for E n does not establish that E n has a proper distribution; all that is 
shown is that the solution is a possibly improper mixture of three exponentials. 




(2.3) 
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3. Validity of the NEAR(2) Model 

In this section we prove the following 
THEOREM. Let { E n > be an i.i.d. sequence of unit mean exponential random 
variables. Then if the four parameters , a 2> , e 2 satisfy 

a-j > 0, a 2 > o, a.| + < 1 , 0 < ^ , 8^ < 1 , the relationship 



where 





Vn-i 


w.p. 


° 1 ’ 
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3 2 X n-2 


w.p. 


a 2 5 


’ + E n , n = 0, + 1, + 2,..., 


(3.1) 
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w.p. 
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E n = ' 


b 2 E n 


w.p. 


P 2 » 




(3.2) 




b 3 E n 


w.p. 


p 3 ’ 







defines a stationary sequence of (marginally) exponentially distributed 
random variables with mean one. Here 

P 2 = {(a 1 3 1 + a 2 e 2 )b 2 - (a-, + a 2 )R 1 R 2 }/{(b 2 - b 3 )(l - b 2 )} , (3.3) 

P 3 — {(ot-j ^ a 2 ) 3i r 2 - ( ot 1 3 1 ^ 2 ^ 2 )bg )/ ^ ( b 2 - b^) (1 - b 3 ) } , (3.4) 

and 

0 < b 3 = 1 {s - (s 2 - 4r) 1/2 j < b 2 = \ |s + (s 2 - 4r)J < 1 , (3.5) 

where 

S = (1 - a-|)3-| + (1 - a 2 )3 2 , (3.6) 

and 

r = (1 - a-| - a 2 )g^6 2 . (3.7) 
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PROOF. For the NEAR(2) model specified by (3.1) - (3.7), let <|> Y (t) and 

A 

<J>^(t) be the moment generating functions of the { X n > and (E n } sequences; 
then quite simply if stationarity of the {X n > series is assumed, 

<(>^(t) = <(>g( t) {a-| <f>^( 3 1 1 ) + + (1 _ a i - c^)) • (3.8) 

Assuming an exponential marginal distribution of unit mean for {X n > , then 
the independent distribution of {E n > has moment generating function, 
possibly not proper, given by 

(l+8it)(l+e 9 t) 

<M t) = ~y . (3.9) 

(l+t)[(l-a-| - o^)^^ + {( 1 ~ot-j ) 6-j + (l-c^^it + ^ 

First we remark that (3.9) simplifies in special cases, such as when one of 
3] » = 1 > 3-j = or a-j + a 2 = 1 . For simplicity in the development, 

the parameter region considered in the theorem is that defined by 0 < 6-j , 

02 < 1 » <*i > «2 > 0, a, + a 2 < 1 . Interesting special cases are more easily 
treated separately; these include NEAR(l) models and particularly tractable 
types of NEAR(2) models. 

It is convenient to establish right away that the quadratic term in the 
denominator of (3.9) has real distinct and positive roots, b-| and b 2 ; this 
eliminates any subsequent need to invert such a term as a whole. The required 
condition for real distinct roots is that 

9 

[ ( 1 — ot 1 )^i + ( 1 - 02 ) ~ _ a 2^i®2 

be positive: this is so from its equality to the expression 
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is obvious 



2 

[(l-a 1 )6 1 - (l-a 2 )e 2 ] + 40^23^2 

which is clearly positive; the positivity of the roots b 1 and b 2 
from (3.9) since their product and sum given in (3.11) and (3.12) below are 
both positive. 

Subject to the above qualifications concerning b-j and b 2 , a partial 
fraction expansion of (3.2) can be written in the suggestive form 

*E ( t) = Cl-P 2 -P 3 ) ^ + P 2 + P 3 y+FJt • (3 - 10) 

Comparisons between (3.9) and (3.10) then show that b^ and P 2 , p-^ may 
be obtained in terms of ^ anc * “i » « 2 by solving the equations 



t>2 + C 1 ) &i "t ( "I *^2 ) ^2 ’ (3.11) 

^2^*3 — ®i ** <* 2^1 ^2 ’ (3.12) 

( 1 -b 2 ) P 2 + O-b-jJPg = ct-j g-| + » (3.13) 

b 3 ( 1 — b 2 ) P 2 ^ 2 (^ "b3)P3 — (®i ^ ^1 ^2 * (3.14) 



A difficulty with this apparently straightforward solution is that the inversion 
of (3.9) or (3.10) could lead to a function which is not a probability density, 
or it could yield a probability density but not one which is a probabilistic 
mixture of three exponentials. In fact, neither of these possibilities is 
the case, as will be shown by establishing that p 2 and P 3 are positive 
and subject to the condition P 2 + P 3 < 1 » and hence can represent 
probabilities. 
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Explicit expressions for p 2 and p 3 can be obtained from (3.13) and 
(3.14) and are given at (3.3) and (3.4). From now on it will be assumed, 
in accordance with the theorem, that b 2 is the larger of b 2 and b 3 , 
these being obtained by solving the quadratic pair (3.11) and (3.12). To 
establish that p 2 + P 3 < 1 , we have, by adding (3.3) and (3.4), 

( $1 $ 2 ) “ ( o &2 ) ^2 

p 2 + p 3 = (l-b 2 )(l-b 3 ) • { 

Multiplying out (l-b 2 )(l-b 3 ) in the denominator and using (3.11) and (3.12) 
gives, after some rearrangement. 



The algebraic expression here is clearly positive and less than one, from 
which it follows that p-j + p 2 < 1 . 

The positivity of p 2 and p 3 will now be proved by showing that the 
numerators and denominators of (3.3) and (3.4) are positive. For the 
denominators, this requires that 0 < b 2 , b 3 < 1 which will be verified 
by showing that 0 < b 2 b 3 < 1 and 0 < (l-b 2 )(l-b 3 ) < 1 . The first of 
these latter two inequalities is obvious from (3.12); for the second consider 
the expressions 



P 2 + P 3 ^ (l-6i )(1- 3 2 ) + a]8i(l-&2) + a 2^2^"^l^ 



(l-6 1 )(l-6 2 ) 



(3.16) 



(l-b 2 )(l-b 3 ) = 1 - (b 2 + b 3 ) + b 2 b 3 

— ( ct -j 3 1 1-3-j ) (<*2^2 "t 1 — - )(o t 2^2^ 



(3.17) 



after using (3.11) and (3.12), and then 
1 - (l-b 2 )(l-b 3 ) 




- ( 1 -a i ) B-j ( 1 - ( 1 ~ a 2 )®2^ - ®l ^ ^1^2 * 



(3.18) 
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Both the right hand sides of (3.17) and (3.18) are obviously positive. This 
concludes the proof that 0 < b-j , < 1 and hence that the denominators of 

P 2 and p 3 are positive. 

For the numerators of p 2 and p 3 to be positive (3.3) and (3.4) 
indicate that b = (a^ + a 2 )gie 2 /(a-jg-j + a 2 6 2 ) must satisfy the inequalities 

b 3 < b < b 2 . (3.19) 

At this last stage, explicity expressions for b 2 and b 3 must be used, 
and from (3.11) and (3.12) are given, after writing 

s = ( 1 — ot ■, )f$i + (l-a 2 )e 2 and r = (l-o^-o^)^ e 2 , 
by 

b 2 = -Jrjs + (s 2 -4r) 1>/2 | and b 3 = ^-js - (s 2 -4r) 1/2 |. (3.20) 

Then (3.19) is equivalent to 

- (s 2 -4r) 1/2 < s - 2b < (s 2 -4r) 1/2 
or s 2 -4r > (s-2b) 2 

or sb - b 2 - r > 0 . (3.21 ) 

After some algebraic rearrangement the left hand side of (3.21) becomes 

ot l°‘ 2 ^l ^2 ( ^1 ~^ 2 ) 2 / (“l ^l +ot 2^2^ (3.22) 

which is again clearly strictly positive, excluding special cases, as 
was to be proved. 
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This concludes the proof that and P3 are both positive subject to 
P2 + P3 < 1 ; hence 1^ - P3 » P2 > and P3 can all be regarded as prob- 
abilities. Thus E n has a proper probability distribution which can be 
generated as the (I-P2 - P3> P£» P3) mixture of three exponentials of 
means 1, b2 and b^ respectively; further, both b2 and b^ are less than 
unity and b2 f b^ . 

In the special cases mentioned earlier, there are valid and simpler 
results for the distribution of . For instance, when ^ ^ = 1 > 

E n has a simple exponential distribution of mean (1-a-j - c^) . When 
^1 ~ ^ ^ the innovation has a mixed exponential distribution of the 

NEAR(l) form given in (2.1) with a = a-j + . Delineation of all these 

special cases is needed for successful simulation of the NEAR(2) process 
for the full parameter range. This will be considered elsewhere. 
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4. Other Uses of the NEAR(2) Exponential Construction 

The NEAR(2) process was established by showing that (3.2) was a valid 
innovation distribution for the relation (3.1). The distributional assump- 
tions implied by this result can also be taken out of the time series context in 
which they were derived and viewed generally as a way to combine a pair of 
(possibly dependent) unit exponential variables (Lp L 2 ) with an independent 
triple of (possibly dependent) unit exponential variables (Mp M 2 , M^) so 
as to yield a further unit exponential variable. Specifically, with 
(ctp a and ( b 2’ b 3’ P 2’ P 3^ aS P reviousl y related by (3.3) - (3.7), 



the Theorem has 


established that 












6 1 L 1 


w.p. 


a l 




■ h i 


w.p. 


1-p-J - P 2 




6 2 L 2 


w. p. 


a 2 


■ + « 


b 2 M 2 


w. p. 


P 2 


(4.1) 
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w.p. 


1 *“‘ 0 t 1 “ ot 2 
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b 3 M 3 


w. p. 


P 3 





has a unit exponential distribution. 

First of all, the idea of "switching" will be illustrated; in the NEAR(2) 
context, this suggests taking (Mp M 2 , M^) as (X ^ , X n _ 2 , X n _ 2 ) and 



(Lp L 2 ) as (E n , E n ) . 


Then 


(4.1) gives 


the 


time 


series model 
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w.p. a-j 
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b 2 X n-2 


w.p. 
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3 9 e 
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W.p. ct ^ 


(4.2) 




b 3 X n-3 


w. p. 


P 3 




0 


W. p. 1 — a 1 - (*2 • 





This is a third-order autoregression, actually a case of the EAR(3) model 
cited in Lawrance and Lewis (1979); note, however, that this third-order 
autoregressive exponential process allows zero innovations. Another, better 
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behaved higher-order exponential model -in fact a p-th order model- is 
obtained by the following application of the result ( 4 . 1 ) in its original 
form ( 3 . 1 ). Let the indices l, 2 ,...,p be partitioned into two non-empty 
sets I-| and of size t-j and t 2 respectively. Then in the model 



X 



n 



¥n-l 


w.p. 


vl. 


• • • 

w.p. 


0 


w.p. 




n = 0, + 1 , + 2,.. . , 



( 4 . 3 ) 



let e' = 3,, i6I,; = Bo, i € I ^ \ a! = a, and 7 a \ = a 0 . Then 

i€l. 1 ' 16I 2 ' * 

if a ] + a 2 < 1 » 0 < 3 -| » 3 2 < 1 the distribution of E n is given by the 

Theorem. Thus we have a pth-order exponential autoregressive process with 

four-parameters. However, while this may seem satisfying it is not clear 

that four parameters would be sufficient to characterize the sample path 

behaviour of an exponential process with very high order dependence. 

Another use of ( 4 . 1 ) is to allow L n and L 0 to both be X , and 

I 2 n- 1 

so obtain a four parameter first-order model of the form 





Vn-1 


w.p. 


a l 
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w.p. 


l-p 2 - p 3 . 
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II 


e 2 X n-l 


w.p. 


a 2 


+ < 


b 2 E n 


w.p. 


P 2 9 


( 4 . 4 ) 
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w.p. 


1 ”“0t -j — ■ &2 




¥n 


w.p. 


p 3 • 





Four parameters seems excessive for a first-order autoregressive process but 
simulations show a wide range of behaviour in sample paths with different 
choices of parameters. Equation ( 4 . 4 ) in turn suggests a first-order model 
allowing negative dependence. This is obtained by letting the variable in 
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(4.4) which is multiplied by be the antithetic transformation of X , , 
-X i 1 n " ] 

that is - log(l-e ) . A two parameter version of the model could be 

obtained, for example, taking a i = a 2 ’ e l = e 2 ' 

A third type of use of the construction is to give mixed autoregressive 
moving average models; for this, (L-j , l_ 2 ) is (X n ^ , X n 2 ) as previously, 
but (M-j, M 2 , M^) are chosen to be (E^, E^ , E ) for a second-order moving 
average compoent, or as (E^, E , E^ +2 ) for a third-order moving average 
component. 

Out of the time series context, the construction suggests ways to obtain 
multivariate exponential distributions, rather as in Lawrance and Lewis 
(1983b). 

Further possibilities are numerous, but it is not the intention here to 
exhaustively list them, or to derive the details of those cited at this time. 
Analysis in the following sections will deal with the basic NEAR(2) model. 
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5. Autocorrelation Structure of the NEAR(2) process 



In this section we show that the autocorrelations p(&) = corr(X n ,X n £ ), 
Si = 0, + 1, + 2,... of the NEAR(2) process satisfy Yule-Walker type dif- 
ference equations; thus the second moment dependency aspects of the process 
are indistinguishable from those of a standard autoregressive model, AR(2). 

To show this it is convenient to write the equation (3.1) as an additive 
autoregressive combination of X n _-|, X n _2 and E n . Thus we have a random 
coefficient, linear additive autoregressive process 





X n = Wn-1 


+ e 2 K n 


x 9 + 1 

n-2 






r 

1 w.p. 


l-p 2 


00 

CL 

1 


where 


L n = 1 


b 2 w.p. 


P2 


5 






b^ w.p. 


P3 


> 






O.o) 


w.p. 


0^1 > 


{K n’ 


II 

V = l 


(0,1) 


w.p. 


a 2 5 






(0,0) 


w.p. 


1 “06 1 “ 


and the 


i . i . d . 


sequences {L n } 


and (K 



n - 0, + 1» i 2 , . . . , 



n -0, + 1, +2,..., 



n - 0, + 1, + 2,... 



(5.1) 



(5.2) 



(5.3) 



n n j r 

independent of the independent exponential sequence { E n > . The E p ' s are 

assumed to have mean and variance 1 , as do the X n 's by construction. 

I I 

Now E(K n ) = a-j and E(K n ) = , so that E(l_ n ) = 1-a^P-j - . 

Then multiplying X n in (5.1) by X n _^ we have, for £ > 1 , 



E ^ X n X n-£^ " a l e l E ( X n-l X n-£^ + a 2 3 2 E ^ X n-2 X n-£^ + E ^ L n^ E ^ E n^ E ^ X n-£^ 
= a ] E( ^n_i x n _£^ + a 2 B 2 E ^ X n-2 X n-£^ + 1 " a l 6 l " a 2 B 2 * 

so that 

E ( X n X n-£^ ‘ 1 = a l e l {E ^ X n-l X n-£^ 1} + a 2 e 2 {E ^ X n-2 X n-£^ " 
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and 



p(-l) = a-|p(-(£-l ) ) + a 2 p(-{l-2)) , 



where and a 2 = . Using p(-l) = p (£), £ > i , we have 

final ly 

p U) = a-| P (£-l ) + a 2 p(£- 2 ) , l > ] , ( 5 . 4 ) 

which are the Yule-Walker equations for the AR(2) process. The conditions 
for a solution to exist (Box and Jenkins, 1970, p.58), a^ + a 2 < 1 
a l ‘ a 2 > " 1 » a 2 > " 1 are clearly satisfied when the conditions on 
“l . a 2 > f-| » 9 iven in the Theorem of Section 3 hold. We then have 

p( 1 ) = a-|/(l-a 2 ) and p( 2 ) = a-|p(l) + a 2 . ( 5 . 5 ) 

Note , however , the restriction to positive correlations since a-j and a 2 

are positive . The possible region of (p(l),p(2)) values is bounded below by 
o 

p(2) = p (1) and otherwise bounded by p(l) > 0 and p(2) £ 1 . Broadening 
of the model to negative dependency may be achieved using antithetic ideas, 
or the bivariate scheme given in Gaver and Lewis (1980), but is not pursued 
here. 

Note too that the parameters in (5.4) enter only as products a-j = <x-| B-j 
and a 2 = a 2 B 2 . Thus for small enough 0 ^ and a 2 , values of 6 -j and b 2 

greater than one could be allowed, and (5.4) would still have a stable 

solution. However the sequence E p in the defining equation (3.2) may not 

exist; it is not known if B-j <_ 1 and 62 1 1 is a necessary condition for 

this existence. 

Solutions to (5.4) are given in detail in Box and Jenkins (1970, 
pp. 58-60). 
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Specifying allowable values of p(l) and p(2), as may be done by an 
initial second order analysis of data, leaves two parameters to be specified 
in the model, say a-j and which could produce very different sample 
path behaviour in the time series. It is important to notice that this 
specification of p(l) and p(2) further constrains the range of possible 
a-j and c *2 values. Recalling that p(l) and p(2) fix a-j = a-jB-j and 
a2 = a2^2 ’ as we ^ as that a i + a 2 — 1 * it is easily shown that we must 
have 

a 1 £ a i ancl a 2 £ a 2 (5.6) 

which implies that a-j + a2 £ a-j + £ 1 • ™ us a-| and 0^ are forced 

to lie in a triangular subregion of the triangular (a-po^) region which is 
bounded below by a 2 , bounded on the left by a-j , and bounded above by 
the line a-j + = 1 . 
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6. An analysis of wind velocity data 
6.1. Discussion of the data 

Lewis and Hugus (1983) have given an analysis of a set of 3-hourly 
wind velocity readings taken by ship PAPA in the Gulf of Alaska over a 15 
year period. After suitable detrending to remove 1 year, 6 month, 12 hour 
and 6 hour cyclic trend components a first-order autoregressive Gamma model 
(Lewis, 1981) was fitted to the data, the use of the model being suggested 
by the shape of the (marginal) histogram of the data (Figure 6.1) and the shape 
of the normalized log periodogram of the data (Figure 6.2). Note that the 
sample size is N = 43,800; also there is a residual 6-hour effect (P = 21900) 
because this cycle varies in intensity over the 15 years. In what follows 
this will be ignored and the data will be treated as stationary. 

It is not the object here to discuss the above analysis in detail but 
to discuss a different analysis of the data using an assumption of a Weibull 
marginal distribution and a transformation to exponential variables. This 
is suggested firstly by the fact that a Weibull distribution is commonly used 
for this type of data by meteorologists and secondly by the fact that Weibull 

and Gamma distributions fit the data equally well (Lewis and Hugus, 1983). 

* 2 2 05 

The histogram of the transformed data, X n = X n * , is shown in Figure 6,3, 
where the power transformation to exponential ity has been determined by fitting 
the empirical coefficient of variation, 0.479, to the theoretical Weibull 
coefficient of variation, C(X) = {r(2/c + l)/[r(l/c + l)] 2 } - 1 to give 
c = 2.205. This transformation does affect the correlation structure of the 
data, as shown in Table 6.1. 

Column 2 in Table 6.1 gives the estimated auto correlations, p(t) , 
of the detrended data; rough 95% confidence bands for these estimates are 
given by adding and subtracting 2/(N)^ 2 = 2/(43, 800)^ 2 ~ .01 . The first 
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Figure 6.1. Histogram of detrended wind velocity data. Positive. 

skewness is evident, suggesting either a Gamma or 
V/eibull model for the marginal distribution. Sample 
size N = 43,800. 
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Figure 6.2. Log periodogram of 3-hourly detrended wind velocity 
data. Note the slight presence of the 6-hours cycle 
(P = 21900). The periodogram clearly shows that the 
data is serially correlated. Sample size N = 43,800. 
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Figure 6.3. Histogram of transformed, detrended wind velocity data. 

The transformation is X' = where the X , 

n n n 

n = 1,..., 43, 800 are the original detrended data. The 
shape is clearly exponential, but the estimated skewness 
and kurtosis have values greater than the theoretical 



FIT OF WIND SPEED DATA AUTO-CORRELATIONS 
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since tne aDsoiute values or the dirrerences are large compared to the approximate standan 
deviations of the estimated auto-correlations, 1//N = 1//43.800 ; .005. An adequate 
fit is obtained (Columns 6, 7 and 8) to a model with AR(2) auto-correlation structure. 



column gives the fitted auto correlations for a model with AR(l)-type auto 
correlations, just 'p(i) = (p(l)) £ = ( . 821 4 ) £ , for lags i = 1,...,5. 

The maximum difference (3rd column) is 0.0109 and the fit is clearly good. 

Column 6 in Table 6.1 gives the estimated auto corelations, p’(ji) , 
for the transformed data; the transformation changes the serial correlations 
slightly. However, columns 5 and 7, which give the fitted AR(1) and AR(2) 
correlation models respectively, show that a model with AR(1) correlation 
structure is not adequate, but that a model with AR(2) correlation fits well. 
This fit is borne out by a periodogram analysis, which is not given here. 

Thus, a NEAR(2) model is a candidate for representing the transformed 
data, and if p ( 1 ) and p(2) are fixed at the estimated values of 0.7985 
and 0.6574, then the corresponding a-j = (a-|B-|) and = ( a 2 6 2^ ^ rom (5. 5) 
are, respectively, 0.75488 and 0.05463. There are still two degrees of free- 
dom left in fitting the model, represented by choice of parameters a-j and 
greater than an equal to 0.75488 and 0.05463 respectively, with 

a l + a 2 — 1 • 

Figure 6.4 shows the logarithm of the periodogram of the usual AR(2) 
model linear residuals, R n = - a i^n-l “ a 2^n-2* trans ^ orme d data. 

This is flat (ignoring the slight effect at period 6 hours). At this point 
it might be thought that the usual second-order autoregressive model is 
adequate. We shall however now develop an extended residual analysis for 
higher order dependence which justifies fitting the NEAR(2) model to the 
transformed data. 
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Figure 6.4. Log periodogram of linear residuals R n = - 0.75488 

X'n-1 " 0-05463 X ^_2 for the detrended and transformed 
wind velocity data . 



7. A Residual Analysis for the NEAR(2) Model 
7.1. General results 

It has already been remarked that the autocorrelations p(2.) are 
insufficient to describe the dependency structure of NEAR(2) models. A 
natural next step might be to examine higher joint moments and consider their 
associated spectra (see e.g., Priestley, 1982). The functions so obtained, e.g. 
the bispectrum, are often found to be difficult to calculate and hard to 
interpret. Rather than follow this course, it is proposed to adapt some 
ideas from a residual analysis for autoregressive models which has recently 
been suggested by the authors (Lewis and Lawrance, 1983). The thrust of 
this analysis is that the standard process of fitting and validating a linear 
autoregressive model is carried out beyond the customary final stage at which 
uncorrelated residuals are obtained (as in the previous section). The usual 
presumption is that the residuals are not only uncorrelated but also inde- 
pendent. This need not be the case, as will be exemplified for the wind ve- 
locity data. Moreover, dependent but uncorrelated residuals are obtained 
(Lewis and Lawrance, 1983) even for the NEAR(2) process. Thus the residuals 
should be subjected to further analysis in respect of their remaining de- 
pendency. Any found is then evidence that a standard linear, constant co- 
efficient second-order autoregressive model is deficient. With normally 
distributed time series data this might suggest that non-linear modelling 
should be explored. With data marginally distributed in some other identi- 
fiable manner, then the exploration of a particular type of model with 
specified marginal distribution and autocorrelation function is suggested. 

This latter course is envisaged here. 
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Higher order dependency properties of the uncorrelated residuals are 
obtained for the proposed model and compared with their data counterparts; 
this stage can be informative from both exploratory and estimation consid- 
erations, and can be thought of as part of the model -ref inement process 
common to much statistical methodology. 

It might be thought that the specific class of NEAR(2) models could be 
incorporated in a residual analysis in the standard manner. However, a 
moments reflection indicates that even after estimating parameters of (2.2), 
it will not be possible, because of the mixture involved, to write down an 
expression or recursion for the residual innovations, namely E . However, 
the corresponding autoregressive (or linearized) residual is available and 
given, as in the previous section, by 



R n ' X n ' 



a l X n-l 



" a 2 X n-2 



We now show that these are uncorrelated for the NEAR(2) process. 



(7.1) 



7.2. The residual theorem. 

The autocovariances of the residuals (7.1) are 

Cov < R n' "nn 1 ‘ Cov < X n- V. 1 ‘ a l Cov(X n-l • W * a 2 Cov(X n-2- W 

(7.2) 

- Cov(X n , R n+t ) - a,Cov(X n> R n+Jl+1 ) - a 2 Cov(X n , R„ +(+2 ) , 

since the { } process and consequently the { R n } process is stationary. 

The covariances on the right hand side are all of the same type and given by 

Cov(X n , R nH ) - Cov(X n , (X nn - a,X n+I _, - a 2 X n+J[ _ 2 )> 

(7.3) 

= {Var(X)}(pU) - a i p ( £- 1 ) - a 2 p(fc-2)), £ = 1,2,... 
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By the Yule-Walker equations (5.4) the expression in brackets is zero, and 
hence also 



Corr(R n , R n+£ ) = 0 , *=+l,+2,... (7.4) 

as was to be proved. That these residuals are uncorrelated is an immediate 
consequence of the autocorrelations following Yule-Walker equations; this 
emphasizes that this type of residuals will be uncorrelated for any model 
whose autocorrelations satisfy Yule-Walker equations. 

The analysis of the uncorrelated residuals R n ,n = 3,4,... should begin 
with scatter plots of the low lag adjacent values; any patterns or concen- 
trations will be evidence of dependency in the residuals. Many possibilities 
present themselves but only one is pursued in the following Section 8, and 
then applied in Section 9 to a continued analysis of the wind velocity data. 
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2 

8. Cross-correlation analysis of {R^} and { } 

After the satisfactory fit to data of an ordinary linear model, the 
residual, R n , should not only be uncorrelated but also independent; the 
latter is customarily investigated by seeking a flat spectrum, while for 
the independence, a flat spectrum of the squared residuals can be sought. 

As a method for probing model validity, the examination of squared residuals 
has been employed by McLeod and Li (1983), following Cranger and Andersen 
(1978); these authors suggested bilinear modelling for dependent but un- 
correlated residuals of ARMA models. It is suggested here that autocorrela- 
tions of squared residuals and cross correlations of residuals of squared 
residuals be used in the analysis of higher order dependence of the detrended 
transformed wind speed data and its modelling by the NEAR(2) process. The 
residuals for this data have already been shown to have a flat spectrum 
(Figure 6.4) while the curved plot in Figure 8.1 for the cumulative periodogram 
shows that the spectrum of the squared residuals is fan from flat. 

Theoretical investigation of the squared residuals of the NEAR(2) model 
is pursued here. Whilst the autocorrelations of the squared residuals have 
just been mentioned, for the NEAR(2) model this involves computation of 
36 terms, mostly distinct types of 4^ order moments. A simpler suggestion 

which involves only third-order moments, and is thus the next step up after 

2 

autocorrelations, is to use the cross correlations of the R n and R p sequences; 
apart from lag 0, zero values will be found for linear models. This sug- 
gestion is more tractable than the squared residual analysis and will now 
be described. Sampling properties of third-order moments are also likely 
to be less extreme than those of fourth-order moments. 

The starting point of the calculation is to note that from the definition 
of R n at (7.1) that 



29 




“O 

o 

CD 

CL 

3 

o 

-CZ 

oo 




oo 

(T3 

3 

c r 
CD 





CO 


CD 




cz > 


CL 




•»- o 


CL 




>> cz 


3 


CM CZ 


r— S- 




QZ 


C- ■ 


CD 




Qi E -C 


** 


■u co 


4-> 


00 


CZ 1 




1 — 


3 > 


O 


0 3 


o 


4- } 


3 


CD C- 




“O 


-3 o 


'O 


*r— 


1 — CO 


CD 


00 


o 




CD 


E 


03 


S~ 


. , — 


Cl. 


1 


03 O 


E 


"O 




o • 


CD 


03 


CJ 


S- 


-a cd 




03 


-CZ 


-CZ 


3 


-a -m 


CO 


c r 


CD 


• r — 


00 


CD 4— 
Cl O 


-CZ 


CD 


00 




-CZ 


1 CD 


sJ 


4-> 


“O 3 


CD 




CZ r— 


> 




**- 03 




o 


5 > 


00 


4- 




• 1 — • 




“O 


U 


00 


CD 4-> 


-CZ * r- 


CD 


“O 03 


U 4-> 


3 


CZ i — 


•r- 10 


i — 


CD 4— 


-3 *r- 


ro 


S- 


2 +-» 


> 


4-> 4-> 


03 




CD O 


CO 4-> 


E 


“O CZ 


O CO 


03 




. 




- >>00 CD 


CD 


“O r— 


JC 


O 


CD C- 


00 +-> 


“O 


E 03 


•l— 


o 


S- CD 


4— 


•r— 


O r— 


u O 


S- 


4— O 


•r— 


CD 


00 


4-> <l> 


CL 


cz 00 


00 r- 




03 *r- 


•r— *r— 


~o 




-4— > 4-> 


CD 


+-> E 


03 C 


4-> 


3 


4— > 03 


03 


CD C_ 


00 3 


i — 


JZ 4-> 


CT 


=3 


4-> U 


4-> 


E 


CD 


00 CO 


3 


4- CL 


CD CO 


CO 


O 00 


4~> • 



_• 

CO 

CD 

S~ 

3 

CD 

e 






30 






(3.5) 



E(R n R n- t > ’ E < R nV«> ' a l E ( R n X n-t-l ) ' a 2 E < R n>W, > • 



whence there is the structural form. 



Cov(R n' R n-i> = Cov(R n' X n-t> ' a l Cov < R ^- Vm> ’ a 2 Co ''< R n' x „-,. 2 > • < 8 - 6 > 

Calculation of the covariance terms in (8.6) requires the expanding out of 
2 

R n , taking expectations and expression in covariance terms. Thus 



R n X n-t ' X n X n-* * a l X n-1 X n-l + a 2 X n-2 X n-t 

- 2a l X n X n-l X n-l - 2a 2 X n X n-2 X n-« 

+ 2a l a 2 X n-l X n-2 X n - i ■ 



(8.7) 



The conversion to covariances yields 



where 



Cov(R 2 , X n-£ } = J l (£) + + a 2 J l (£_2) 

-2a^J^(y.) - 2a 2 J-^ £) 2a-j a^J2 ^ x- l ) 



^(0 = Cov(X 2 , X^) ; J 2 (0 = Cov(X n X n _ r X p J 

J 3 (£) = Cov(X n X n2 , X p _.) . 



(8.3) 



(3.9) 



We thus see the types of third-order joint moments which are involved in 
2 

the Cov(R , R „) calculation: each of these has to be obtained using two 

n n-£ 

difference equations, one for l > 0 and one for £ < 0 . 

Taking J^(£) for illustration, square each side of the NEAR(2) defin- 
ing equation (2.2) and multiply by X n _ ? . After converting to the required 
covariances, the recursion is found to be 

J-|U) = a-j 0-j J-j U-l ) + a 2 B 2 J-| (£-2) + 2( 1 -a-| - a 2 )p(£) . (8.10) 
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For negative z there is the corresponding equation 



J 1 (Jl) = a 1 J 1 (£+1) + a 2 J-|(£+2) , 8.11) 

O 

obtained by multiplying each side of (2.2) by X n _ ? and converting to 

covariances. These equations are given for illustration: there are similar 

equations for J 2 (t) and J^z) , and various special cases. Also required 

2 

is the variance of R n which involves many terms. 

The complete algorithm for computing the cross-covariances is as follows. 

Algorithm for computing cross-covariances for ft = 0, * 1, ... ±L . 

0. Input a-| $ a 2 , ^ • 

Set p(0) = 1; p ( 1 ) = a-|/(l-a 2 ); p(2) = a-^ + a 2 and, for ft = 3,...,K , 
pU) = a^p(x-l) + a 2 pU-2). Note that p(-ft) = pU) but these are 
not needed in computations. 



1. 


Set J -| 


(0) = 4; J(l) 


= l^a-jU-j+a^) + 


2 ( 1 -a i - 


-a 2 )]/(l-a 


2&2 ^ * 


2a. 


V*) 


= ^1^1 


3 £vJ i ( £ + 2 ) 


for 


£ = - 


■1, -2,... 


, - (L+4) . 


2b. 




= a-j 3-j O-j ( 1 ) 


^ 2) + 2( 


> )p ( a) 


for z = 2,3,. . .L . 


3. 


Set J 2 


(0) = J 1 (i) - 


p(i ) + l ; 


J 2 0) = 




- p(1) + 


1 . 


4a. 


0 2 U) 


= a 1 J 2 (£+l ) + 


a 2 J 2 ( £+2 ) 


for 


z = - 


1 > -2 , . . . , 


-(L-3) . 


4b. 


J 2 U) 


= ^1 ^1 ^ 


a 2 J 2 (ft-l ) 


+ 2a i + 


(1+ P (l))a 2 








- [1+p(1 ) ] + 


[1 + p(£-‘ 


1 )](l-a r 




for Z 


= 2,3,..., L . 


5. 


Set J q 


,(0) = d-| (2) - 


p(2) + 1; 


o 3 d) = 


J 2 (2)+ 


p(l ) - p (2) 


; J 3 (2) = 0 1 ( -2 ) - p(2) + 


6a. 


J 3 (ft) 


= a-|J 2 U+l ) + 


a 2 J 3 U+2) 


for 


£ = ■ 


-1, -2,... 


, - (L+2) . 


6b. 


J 3 U) 


= a 1 J 2 ( £-1 ) + 


a 2 0 i ( £-2 ) 


+ [1 +p (1 ) ] a i + 


2^2 





+ [1 +p ( ft-2 ) ]( 1 -a i -a 2 ) - (l"tp(2)) for z 3,4,5,..., L . 
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7. JU) = J-, ( s.) + a^U-1) + a|j( a- 2) - 2a 1 J 2 (&) 

- 2a2«J3( ^-) + 2a^a 2 J2( Jl_ l ) for a = - (L+2),..., (L) . 
2 

8. Cross-covariance (R n , R n _ £ ) = J(0 - a-pJU+l) - a 2 J(s,+2) 

for z = - L, . . . , L . 
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9. Further Analysis and Modelling of the Wind Velocity Data 

Dependence in the uncorrelated linear residuals for the NEAR(2) model 

has already been demonstrated (Figure 8.1). Further evidence of this is 

2 

provided by the cross-correlations of R n and R n given in Figure 9.1. 

The corresponding theoretical cross-correlations for the NEAR(2) model will 
next be presented, having been computed using the algorithm given in 
Section 8. At this point, it will be recalled, the NEAR(2) model has not 
been fitted in terms of all 4 parameter; the residuals involve the model 
parameter only through a-j = a-jB-j and a 2 = a 2 B 2 and a-pB-pC^^ have 
not been separately estimated. In the present rather exploratory analysis 
the estimation problem will be circumvented; the cross-correlations of 
(R n ,R n ) will be given for four representative sets of parameter values in 
the reduced allowable region, as constrained by (a-j a 2 >_ a 2 , a-j + a 2 <_ 1 ) , 

according to (5.6). For the detrended transformed wind speed data, 
a-| = 0.75488 at a 2 = 0.05463, and the four chosen sets are given by 

(A) a-, = 0.760, a 2 = 0.06; - 0.99326, B 2 = 0.91050 

(B) a-j = 0.925, a 2 = 0.06; B-| = 0.81608, B2 = 0.91050 

(C) = 0.760, a 2 = 0.20; = 0.99326, B 2 = 0.27315 

(D) a-| = 0.850, a 2 = 0.10; B-| = 0.88809, B 2 = 0.54636. 

2 

In Figure 9.2 the cross-correlations of (R n ,R n ) for eac h °f these 
cases are presented; there is considerable differentiation amongst the 
figures, although the zero values for all but one or two negative lags are 
evident in all four cases. This effect will be due to the directional 
nature of the process. For matching with the wind speed data. Case D 
appear the most promising, although the suggestion of a negative correlation 
at lag minus one lends a little support to Case A . Cases B and C are 
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definitely unsympathetic to the data. Hence, a choice of parameters inter- 
mediate between Cases A and D is suggested by this exploratory analysis. 

A fuller analysis would require estimates of all four parameters and compari- 
son of the resulting residual-squared residual cross-correlations with the 
estimated cross-correlations obtained from the data, as shown in Figure 9.1. 
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10. Conclusions and Further Analysis 

The very broad four parameter NEAR(2) time series model having expo- 
nential marginals and the correlation structure of an AR(2) model has been 
established; further developments will be reported elsewhere. A preliminary 
fit of the NEAR(2) model has been made to a very long series of wind speed 
data, the data having been detrended and transformed so as to have exponen- 
tially distributed marginals; utility of the suggested residual analysis in 
probing higher order dependence has been demonstrated. This residual analysis 
has been based on the cross-correlations between the residuals and the 
squared-residuals. 

An extension to this analysis using reversed residuals is possible; 
more of the higher order dependency of the NEAR(2) model would be revealed 
and this would enable further aspects of its suitability of the model for 
the wind speed data to be ascertained. 
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